Fast Incremental SVDD Learning Algorithm with the Gaussian Kernel
نویسندگان
چکیده
Support vector data description (SVDD) is a machine learning technique that is used for single-class classification and outlier detection. The idea of SVDD is to find a set of support vectors that defines a boundary around data. When dealing with online or large data, existing batch SVDD methods have to be rerun in each iteration. We propose an incremental learning algorithm for SVDD that uses the Gaussian kernel. This algorithm builds on the observation that all support vectors on the boundary have the same distance to the center of sphere in a higher-dimensional feature space as mapped by the Gaussian kernel function. Each iteration involves only the existing support vectors and the new data point. Moreover, the algorithm is based solely on matrix manipulations; the support vectors and their corresponding Lagrange multiplier αi’s are automatically selected and determined in each iteration. It can be seen that the complexity of our algorithm in each iteration is only O(k), where k is the number of support vectors. Experimental results on some real data sets indicate that FISVDD demonstrates significant gains in efficiency with almost no loss in either outlier detection accuracy or objective function value.
منابع مشابه
Incremental Learning Algorithm for Support Vector Data Description
Support vector data description (SVDD) has become a very attractive kernel method due to its good results in many novelty detection problems.Training SVDD involves solving a constrained convex quadratic programming,which requires large memory and enormous amounts of training time for large-scale data set.In this paper,we analyze the possible changes of support vector set after new samples are a...
متن کاملThe Mean and Median Criterion for Automatic Kernel Bandwidth Selection for Support Vector Data Description
Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice ...
متن کاملAn Algorithm of Unsupervised Posture Clustering and Modeling Based on GMM and EM Estimation
Support vector data description (SVDD) has become a very attractive kernel method due to its good results in many novelty detection problems.Training SVDD involves solving a constrained convex quadratic programming,which requires large memory and enormous amounts of training time for large-scale data set.In this paper,we analyze the possible changes of support vector set after new samples are a...
متن کاملEllipse Support Vector Data Description
This paper presents a novel Boundary-based approach in one-class classification that is inspired by support vector data description (SVDD). The SVDD is a popular kernel method which tries to fit a hypersphere around the target objects and of course more precise boundary is relied on selecting proper parameters for the kernel functions. Even with a flexible Gaussian kernel function, the SVDD cou...
متن کاملUnsupervised Nonlinear Feature Extraction Method and Its Effects on Target Detection in High-dimensional Data
The principal component analysis (PCA) is one of the most effective unsupervised techniques for feature extraction. To extract higher order properties of data, researchers extended PCA to kernel PCA (KPCA) by means of kernel machines. In this paper, KPCA is applied as a feature extraction procedure to dimension reduction for target detection as a preprocessing on hyperspectral images. Then the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1709.00139 شماره
صفحات -
تاریخ انتشار 2017